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ABSTRACT 

We present an analysis of the evolution of galaxy clustering in the redshift 
interval < z < 4.5 in the HDF-South. The HST optical data are combined 
with infrared ISAAC/VLT observations, and photometric redshifts are used 
for all the galaxies brighter than Iab < 27.5. The clustering signal is ob- 
tained in different redshift bins using two different approaches: a standard 
one, which uses the best redshift estimate of each object, and a second one, 
which takes into account the redshift probability function of each object. This 
second method makes it possible to improve the information in the redshift 
intervals where contamination from objects with insecure redshifts is impor- 
tant. With both methods, we find that the clustering strength up to z ~ 3.5 
in the HDF-South is consistent with the previous results in the HDF-North. 
While at redshift lower than z ~ 1 the HDF galaxy population is un/anti- 
biased (b < 1) with respect to the underlying dark matter, at high redshift 
the bias increases up to b(z ~ 3) ~ 2 — 3, depending on the cosmological 
model. These results support previous claims that, at high redshift, galaxies 
are preferentially located in massive haloes, as predicted by the biased galaxy 
formation scenario. In order to quantify the impact of cosmic errors on our 
analyses, we have used analytical expressions from Bernstein (1994). Once the 
behaviour of higher-order moments is assumed, our results show that errors 
in the clustering measurements in the HDF surveys are indeed dominated by 
pure shot-noise in most regimes, as assumed in our analysis. We also show that 
future observations with instruments like the Advanced Camera on HST will 
improve the signal-to-noise ratio by at least a factor of two; as a consequence, 
more detailed analyses of the errors will be required. In fact, pure shot-noise 
will give a smaller contribution with respect to other sources of errors, such 
as finite volume effects or non-Poissonian discreteness effects. 

Key words: cosmology: observations - photometric redshifts - large-scale 
structure of Universe - cosmic errors - galaxies: formation - evolution - haloes 
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1 INTRODUCTION 

It is well known that the evolution of the dark matter clustering can be reliably used to put strong constraints on 
cosmological models. In fact the growth of density fluctuations depends on the main cosmological parameters, namely 
the contribution of matter and cosmological constant to the present total energy density (Qom and Qoa , respectively) . 
This result, confirmed by high-resolution N-body simulations (e.g. Jenkins et al. 1998), has been used to build a 
semi-empirical model which suitably relates the linear perturbation scale to the final non-linear scale of the same 
perturbation after collapse (Hamilton et al. 1991). This technique can be used to compute analytically the evolved 
correlation function starting from a given primordial density power-spectrum (e.g. Peacock & Dodds 1994, 1996; Jain, 
Mo & White 1995). 

However, the application of this idea to real data is greatly complicated by the fact that the observed objects 
(galaxies, quasars, clusters, etc.) are not direct tracers of the dark matter distribution. Usually, the ignorance about 
the relation between the object density, S , and the dark matter one, <5 m , is parametrized introducing the so-called 
bias parameter b, for which a simple linear relation is a common assumption: b = S /S m (Kaiser 1984). Note that this 
relation includes the details of structure formation and, as a consequence, is quite uncertain. 

A possible shortcut to the solution of this problem is to relate the value of b to some intrinsic property. For 
example, analytical models (e.g. Mo & White 1996; Catelan et al. 1998; Jing 1999; Sheth & Tormen 1999; Sheth, Mo 
& Tormen 2001), confirmed by the results of N-body simulations, suggest that the bias factor of dark matter haloes 
is a function only of their mass and formation redshift (apart from the cosmological parameters). If there is a way 
to relate a typical observational quantity of the considered objects (such as flux or luminosity) directly to the mass 
of their hosting dark matter haloes, the study of the clustering evolution fully recovers its ability to discriminate 
between different cosmological models. For instance, in the case of galaxy clusters detected in the X-ray band, the 
flux at a given redshift corresponds to a given halo mass, under the assumptions of virial isothermal gas distribution 
and spherical collapse. Hydrodynamical simulations confirm the resulting relations between mass and luminosity or 
temperature, even if with a large scatter. Then the comparison of the observed cluster two-point correlation function to 
theoretical predictions can be used to put some constraints to the cosmological parameters. For example, Moscardini 
et al. (2000a,b) find that the clustering properties of the clusters observed in different samples (RASS1 Bright Sample, 
XBACs, BCS and REFLEX) favour cosmological models with a low value of f2om- 

In the case of galaxies, applying a similar technique is much more difficult, mainly because the relation between 
mass and luminosity is not one-to-one. Moreover, it is not clear how many galaxies can occupy a single halo of a 
given mass. However, once a cosmological framework is fixed, the study of the clustering evolution of galaxies can be 
used to obtain more information about the nature of these objects. For example, it is possible to estimate a typical 
value for the mass of the dark matter haloes hosting the galaxies. Moreover the clustering data can be used to discuss 
if the merging process is important at various redshifts or if the galaxy number tends to be conserved during the 
evolution. In fact, these two opposite models predict a completely different redshift evolution of the bias factor (see 
e.g. Matarrese et al. 1997 and Moscardini et al. 1998). 

From the point of view of the observational data required for this kind of study, enormous progress has been 
made in recent years. Large spectroscopic surveys gave an accurate description of the spatial distribution of the 
galaxies in the local Universe. Statistical analyses have shown that the correlation length depends on morphological 
type and/or absolute magnitude: more luminous and/or early- type galaxies appear to have higher clustering than 
faint and/or late-type galaxies (Santiago & da Costa 1990; Loveday et al. 1995; Benoist et al. 1996; Norberg et al., 
2001). However, these local observations can be reasonably well reproduced by a large variety of sensible cosmological 
models, while possible differences are expected at higher redshifts, as previously discussed. This has been one of the 
main reasons which motivated the extension of spectroscopic surveys to high redshifts. Nowadays, different samples 
are available to estimate the clustering properties of galaxies up to z « 1 [Canada- France Redshift survey (Le Fevre et 
al. 1996); Hawaii K survey (Carlberg et al. 1997); Norris Redshift Survey (Small et al. 1999); Caltech Faint Redshift 
survey (Hogg, Cohen & Blandford 2000) ; Canadian Network for Observational Cosmology field galaxy redshift survey 
(Carlberg et al. 2000)]. Even if the sampled regions are relatively small, the results are in good agreement in showing 
a decline of the correlation length with redshift. 

Up to a few years ago, clustering studies at higher redshifts were limited to peculiar objects like radiogalaxies 
or quasars. The discovery of reliable colour techniques (U-dropouts) made it possible to identify a large sample of 
'normal' galaxies at z ~ 3, the so-called Lyman-Break galaxies (LBGs). By measuring the correlation function or 
computing the count-in cell statistics, different works (Adelberger et al. 1998; Giavalisco et al. 1998; Giavalisco & 
Dickinson 2000) showed that LBGs have a correlation length at least comparable with that of present-day spiral 
galaxies. This result corresponds to quite a high value for the bias factor at z ~ 3, suggesting that their formation 
occurs in massive dark-matter haloes. 

An alternative way to probe larger volumes and/or fainter galaxy populations makes use of the photometric 
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redshift technique (e.g. Lanzetta, Yahil & Fernandez-Soto 1996; Sawicki, Lin & Yee 1997; Arnouts et al. 1999, 
hereafter A99; Bolzonella, Miralles & Pello 2000). This method, based on the comparison of theoretical and/or real 
spectra with the observed galaxy colours in different bands, makes it possible to estimate their redshifts at higher 
magnitudes than those reached spectroscopically by the largest available telescopes. This is done in a probabilistic 
way; as a consequence, the estimates are affected by errors, which typically have been found to increase with redshift. 
Note that to date, these inherent uncertainties in the redshift estimates were completely ignored or estimated via 
simulations and used as an a posteriori global correction to the correlation measurements (A99). A more correct 
approach would require the estimate of the redshift uncertainty for each object and the inclusion of this information 
in the computation of the correlation function, as discussed in this paper. 

Thanks to the application of the technique of photometric redshifts to its very deep observations, the Hubble 
Deep Field (HDF) North (Williams et al. 1996) has become a test case for the evolution of the galaxy distribution. 
The data of more than one thousand objects down to Iab ~ 28.5 have been used to study the redshift evolution of 
the clustering up to z ~ 4.5 (A99; Magliocchetti & Maddox 1999; Roukema et al. 1999; see also the analysis made 
by Connolly, Szalay & Brunner 1998 up to z ~ 1.2). The results show that the comoving correlation length, after a 
small decrease in the interval 0^ z^z 1, increases up to z ~ 4. It is worthwhile to stress that the term "evolution" 
has not to be taken literally. Given a survey defined by its characteristic limiting magnitude and surface brightness, 
the galaxies observed at high z typically have higher luminosities. Therefore, the intrinsic differences of the galaxy 
properties at different z can mimic an evolution, i.e. the evolution measured in a flux-limited survey is not only due to 
the evolution of a unique population but can be due to a change of the considered population. In fact the theoretical 
modelling of the HDF galaxies shows that to reproduce their clustering properties at different redshifts, the mean 
mass of dark matter haloes hosting the galaxies is required to increase with z (A99). 

The reliability of the previous results, however, can be affected by the smallness of the observed field. In particular, 
it is not clear to what extent a region of a few square arcminutes can be considered representative of the properties of 
the whole Universe. The data more recently obtained in the HDF-South (Casertano 2000) offer a unique opportunity to 
test the robustness of HDF-North results, due to their mutual independence. For example, the field-to-field variations 
can be used to estimate the size of the cosmic variance on these scales. The main goal of this paper is to study in 
detail the clustering properties of HDF-South and to compare them with those obtained for the northern field to 
confirm or disprove the general picture described above. 

The paper is organized as follows: In Section 2 we present the photometric database used in this analysis and 
briefly describe the photometric redshift technique. In Section 3 we introduce the two methods used to estimate the 
angular correlation function: the standard approach and an alternative method taking into account the photometric 
redshift uncertainties. Still in Section 3 we present the results of this analysis and estimate the bias factor. Section 4 is 
devoted to a theoretical discussion of the cosmic errors in the clustering estimates in Hubble Deep Fields. Conclusions 
are presented in Section 6. 

2 THE CATALOGUE AND PHOTOMETRIC REDSHIFTS 

2.1 The data 

Deep high-resolution optical dataset (F300, -F450, ^606 and F814) from HST and deep infrared observations have been 
combined. The IR observations have been carried out in J S ,H,K S passbands with the ISAAC instrument on the 
VLT (UT2) during the period July-September 1999. The total integration times are 7h, 6h and 8h in J, H and 
K a , respectively. The final coadded images have a seeing of 0.6 arcsec in J S ,H,K S . The Vega magnitude limits in 
IFWHMs at 5a level are 24,23,22.5 in J S ,H,K S , respectively (Saracco et al. 2001). 

The photometric catalogue containing the optical and infrared colours is described in detail in Vanzella et al. 
(2001). We recall here that the detections are based on the summed V + I images and the deblending process has 
been tuned and optimized in order to obtain a photometric catalogue particularly reliable for photometric redshifts. 
Indeed a modified version of the SExtractor software (Bertin & Arnouts 1996) has been applied to optimize the 
SExtractor parameters (namely deblend-mincont, detect-minarea) in different regions of the frame. This procedure 
allows to improve the deblending of close pairs as well as to keep in single units large spiral galaxies and affects only 
the very small angular scales (8 < 3arcsec). A catalogue of 1474 sources has been extracted up to Iab — 28.5. 

2.2 The photometric redshift measurement 

The technique of photometric redshifts adopted in this paper has been described in more detail in A99. The technique 
is based on \ 2 minimisation which compares the observed magnitudes to the GISSEL96 synthetic library (Bruzual 
& Chariot 1993). In order to quantify the redshift uncertainties, in Figure^ we compare, for galaxies accessible to 
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Figure 1. Dispersion between the spectroscopic redshifts and photometric estimates in the HDF-North (146 spectra) and 
HDF-South (24 spectra) (see text). The redshift dispersion (<r z ) is obtained by using a 3<r-clipping rejection for two samples: 
z< 1.5 and z > 1.5. Catastrophic redshifts (represented by square symbols) have not been used in the measurement. Rejected 
objects during the cr-clipping are shown with open circle symbols. The solid line corresponds to Az = and the long-dashed 
lines to Az = 0.5. 

spectroscopy, the spectroscopic redshifts and those obtained using the photometric technique. The HDF-North sample 
is based on the list of Cohen et al. (2000) which is composed of 146 spectra. The HDF-South sample is based on 
22 spectra from the list of Cristiani et al. (1999) observed with the VLT telescope and from Dennefeld et al. (2001) 
observed with NTT telescope. We also add 2 spectra observed with the Anglo Australian Telescope (Glazebrook et 
al., 1998). In the area of WFPC2 the HDF-South sample consists of 24 spectra, two of which are at z spcc > 1.5. The 
redshift accuracy is defined as in Fernandez-Soto, Lanzetta & Yahil (1999): (^spcc — ^phot )/(l + z spcc ), from which 
we extract the mean (Az) and the dispersion (cr z ) by using a a-clipping algorithm at 3<r rejection level. We obtain 
<r z = 0.05 and Az = 0.03 for z spC c < 1.5 and o~ z = 0.05 and Az = 0.02 for z spcc > 1.5. Two catastrophic redshifts 
were initially rejected from the statistics (shown by large open squares in Figure |l|) and six objects at z < 1.5 and 
two objects at z > 1.5 were rejected during the cr-clipping process (open circles in Figure |l|). The total number of 
rejected objects is 10/170, corresponding to 6 per cent. 

In Figure ^ we compare the redshift distributions obtained for the HDF-North and HDF-South for two intervals 
of magnitude, Iab < 26 and 26 < Iab < 27.5 (upper and lower panels, respectively). The two redshift distributions 
are similar. The Kolmogorov-Smirnov (KS) two-tail statistics does not reject the null hypothesis that the redshift 
distributions in the HDF-North and South are drawn from the same parent population. The KS-probability of the 
null hypothesis turns out to be 0.12 and 0.20 for the samples with Iab < 26 and 26 < Iab < 27.5 respectively. The 
median redshift in the HDF-North seems to be slightly higher for the bright sample, which is not surprising due to 
the presence of large-scale structures at z ~ 1 in the HDF-North (Cohen et al., 2000), also evidenced by systematic 
color differences (Vanzella et al., 2001). 



3 THE ANGULAR CORRELATION FUNCTION 
3.1 Selection of the sample 

To compute the angular correlation function (ACF), we have limited our analysis to the region of the HDF-South with 
the highest signal-to-noise, excluding the area of the PC and the outer part of the three WFPC. The details of how 
the HDF-North and HDF-South photometric catalogues (Fernandez-Soto et al. 1999; Vanzella et al. 2001) have been 
constructed are slightly different and the scale of the total / magnitudes may present systematic differences. This is 
especially true at faint magnitudes. For this reason, rather than applying formally equal magnitude limits to the two 
samples, we prefer to adopt for HDF-South a limit that defines a roughly equal number of sources as in HDF-North 
and a comparable number of objects in each redshift interval (at least 100 except for the range 3.5 < z < 4.5). This 
can be accomplished by selecting in the HDF-South catalogue all galaxies brighter than Iab — 27.5. The total number 
of objects is 844 in an effective area of 4.45 arcmin 2 . The nominal magnitude limit used in the HDF-North by A99 
is Iab — 28.5 which provides 926 objects. Beyond Iab ~ 26 the source counts in the HDF-S photometric catalog 
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Figure 2. Comparison of the redshift distributions of the HDF-North (dashed line) and HDF-South (solid line) for galaxies 
brighter than Iab < 26 (upper panel) and galaxies with magnitudes in the range 26 < Iab < 27.5 (lower panel). 

adopted in the present work are systematically higher than the corresponding counts in the HDF-North catalog used 
in A99 by a factor ~ 1.5. The discrepancy is to be ascribed to differences in the approach used to carry out the 
photometry in the two cases (Vanzella et al., 2001). The redshift bins used are the same as those adopted in the 
analysis of A99. 



3.2 Classical ACF computation 

The angular correlation function u>(9) is related to the excess of galaxy pairs in two solid angles separated by the 
angle 9 with respect to a random distribution. The angular separation used for the computation of covers the 
range from 3 arcsec up to 80 arcsec. We use logarithmic bins with steps of Alog(#) = 0.3. The lower limit of 3 arcsec 
is a conservative estimate of the scale over which we are confident about the deblending approach for resolved bright 
spirals or faint galaxy "groups" . The upper cut-off corresponds to almost half the size of the HDF regions and to the 
maximum separation where the ACF provides a reliable signal. 

To derive the ACF in each redshift interval, we used the estimator defined by Landy & Szalay (1993, hereafter 
LS93): 

n (9) - A DD{6) 9 A ° m I 1 m 

where DD is the number of different galaxy pairs, DR is the number of galaxy-random pairs and RR refers to 
random-random pairs with separation between 9 and 9 + A9. The normalisation factors A\ and A2 are given by 

NANr-V) _ Nr - 1 

Al -N g (N 3 -l) < ^-TaT' (2) 

where N g and N r are the total number of objects in the data and random catalogues, respectively. In the present 
work the random catalogues contain N r = 20000 sources covering the same area as our HDF sample. 

In the weak clustering limit, the above estimator has a nearly Poissonian variance (see LS93), so the uncertainty 
is estimated as: 



dujcst{e) = v ¥mw^ ; < rr w>= rr wi a i ( 3 ) 

The results of our analysis will be discussed in Section 3.4, where they will be compared with those obtained by 
the alternative approach, described in the next subsection. 



3.3 Alternative ACF method 
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Figure 3. Left panel: Two examples of redshift probability function for one object with a secondary redshift peak (upper 
panel) and for one object without secondary peak (lower panel). The area below the curves is normalised to unity. Right panel: 
Redshift distributions for galaxies with Iab < 27.5 using the best redshift value for each object (solid line histogram) and by 
summing up the normalized PDFz (dashed line histogram). 



3.3.1 Redshift probability distribution function 

In our previous analysis of HDF-North (A99), we used Monte Carlo simulations to discuss the effects of uncertainties 
in the z p h ot estimates on the clustering results. In particular we used the simulations to obtain the statistical errors 
in each redshift interval according to the limiting magnitude and to define an upper limit to the amplitude of the 
ACF assuming that the contamination effects are due to an uncorrelated population. In the present work, we define 
an alternative method which includes directly in the ACF measurement the redshift probability distribution of each 
object. For each object we measure a redshift probability distribution function (hereafter PDFz) estimated as follows: 



PDF, xoNpj-^) with xLn(z) = J2 



-^obs.i S ' -ftem,i(-^) 



(4) 



where Xmin( z ) is the best fit value obtained at redshift z; F b s ,i is the observed flux; Ftem,i(z) is the template flux 
at redshift z in i-th band, o~i is the photometric error in i-ih band and s is the scaling factor applied to the template 
fluxes as described in A99 (Equation 2). The PDFz is then normalised to unity over the full range used to derive 
the redshift (here < z < 6). 

This PDFz makes it possible to follow the redshift probability for each object (see also Bolzonella, Miralles & 
Pello 2000) and has some similarity with the Bayesian photometric redshift estimation (Bem'tez 2000). To illustrate 
the behaviour of the PDFz, in Figure [| (left panel) we show two examples for one object at z p hot = 3.52 with 
a secondary peak at z p h ot = 0.32 (upper panel) and one at z p hot = 2.56 with no secondary peak (lower panel). In 
Figure ^ (right panel) we also compare the redshift distribution for objects brighter than Iab = 27.5 obtained by 
using the best redshift for the sources (solid line) and by summing the normalized PDFz of all objects (dashed line). 
The spread in the individual PDFz results in a sort of smoothing of the distribution obtained with the best redshift 
estimates. 



3.3.2 Weighted ACF measurement 

In the previous section, we have computed the ACF assuming the best redshift value for each object regardless of 
its confidence level. In this section we take the redshift uncertainty into account directly in the ACF measurement 
by using the PDFz of each object. For all the galaxies within a given redshift interval, we use the PDFz to weight 
the number of pairs according to the probability of the objects being in the redshift bin. In Figure ^ we compare in 
the different redshift intervals the distribution obtained with the best redshift approach and that resulting from the 
summed PDFz approach. The results are summarized in Table [j]. We find that: 

1) The summed PDFz are in general similar to the original distributions and have tails in the neighbouring intervals. 
The enlargement of the distribution corresponds to a change between 5 and 30 per cent. This effect is mainly due to 
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Figure 4. Histograms of photometric redshifts in different redshift bins (specified in each panel) defined by the best redshift 
(dashed lines) as compared with the histograms obtained for the same objects taking into account the redshift probability 
function (solid lines). 



Table 1. Distribution of the PDFz for different redshift intervals. Column 1: redshift bin. Column 2: fraction of objects within 
the redshift range. Column 3: fraction distributed in the adjacent bins. Column 4: fraction in non-adjacent bins. 



z range 


Fraction in 


Fraction in 


Fraction in 




the bin (%) 


adj. bins (%) 


non adj. bins (%) 


0.0 - 0.5 


63 


5.5 


31.5 


0.5 - 1.0 


73.5 


21.5 


5 


1.0 - 1.5 


66 


30 


1 


1.5 - 2.5 


79 


17 


4 


2.5 - 3.5 


78.5 


16.5 


5 


3.5 - 4.5 


74.5 


12 


13.5 



the redshift uncertainties of objects at the boundaries of the bins under consideration. 

2) At redshifts between 0.5 and to 3.5, a very small fraction of objects shows catastrophic secondary redshifts (< 5 
per cent). For the extreme bins the behaviour is different. The 3.5 < z < 4.5 bin shows a pronounced secondary peak 
at low z corresponding to 13.5 per cent of the total. The < z < 0.5 redshift bin shows a long tail between 1 < z < 4 
corresponding to a fraction of 31.5 per cent. 

We find that the fraction of lost objects for different redshift ranges is in good agreement with the Monte Carlo 
simulations carried out in A99, where a gaussian random noise has been added to the original photometric errors 
(see Figure 4 of A99). This shows that the two approachs provide similar results to quantify the photometric redshift 
confidence levels. 

Since the ACF measurements in the HDFs is based on a small sample, we want to optimize the reliability of 
the signal in each redshift bin. The strategy adopted in the following analysis is to include in each redshift bin only 
the objects for which the best redshift belongs to the bin. We call iVdata their number. This allows also a direct 
comparison between the classical ACF and this method. 
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To measure the weighted ACF, the number of pairs in the redshift range z m i n < z < z ma x, entering in equation |y, 
is replaced as follows: 

DD = V P6 l ■ P6 J ■ DR = V Pfe 1 , (5) 

i,j i=l,j = l 

where Pfe 1 represents the integral of PDFz between z m i n and z max for the i-th object. 

The normalisation factors j4,i and A2 are the same except that the total number of objects no is replaced by 

The results are presented in the next subsection. 



3.4 Results 

In this section we discuss the clustering properties of the HDF-South. Figure |E] presents the measurements of the 
ACF in different redshift bins obtained using both methods discussed in the previous subsections. In particular, filled 
circles refer to the classical ACF estimates and open circles to the weighted ACF ones. Note that the errorbars are 
slightly larger for this last method. In fact, in this case the effective number of contributing points no is smaller, as 
the galaxies have a non-vanishing probability outside their bin. 

In order to give a more quantitative estimate of the correlation strength, we fit the data by adopting a power-law 
form for the ACF as ui(6) = A^Q~ & . If the spatial correlation function £ is also assumed to follow a power-law relation, 
i.e. £(r) = (r/ro) ' , the slope 7 is simply related to 8: 7 = 8 + 1. Since the galaxy samples are small, we prefer to 
derive the amplitude A u by fixing the value of 8. As in A99, we adopt 8 — 0.8 but we will discuss this assumption 
later. 

Due to the small size of the considered field, we have to take into account the integral constraint IC (Peebles 
1974) in our fitting procedure as: 

LO cs t — C^truc — IC . (6) 

The quantity IC is defined as the integral of the ACF over the survey, i.e. 

IC = ZU Smax = J J u{e)dQ,itKl* = A u x B , (7) 

where 9 mal is the maximum scale of the survey. The integral B has been computed by a Monte-Carlo method using 
the same geometry as the HDF-South and masking the excluded regions. Adopting the value 8 = 0.8, we derive 
B — 0.033 (for measured in arcsec). 

The fitting power-law relations are all shown in Figure |^, both for the classical ACF (solid lines) and weighted 
ACF (dashed lines), while the values of the amplitude of ui(6) at 10 arcsec are reported in Table |i[ In general we 
find a good agreement between the results of the two different techniques. We find some differences only in the two 
extreme redshift bins ((z) = 0.25 and (z) — 4), where the objects typically display significant tails in the PDFz. Here 
the weighted ACF seems to allow a better extraction of the signal, giving larger values for the correlation function. 
However, due to the large errorbars, the two methods are still consistent at the la level. Finally we note that in the 
redshift bin between 3.5 < z < 4.5 the results are consistent with the assumption of vanishing clustering. 

In order to discuss the effects of the assumed slope on the clustering normalisation, in Table [| we also show the 
amplitudes obtained using 8 = 0.6 and 8 = 0.9. The integral constraints IC have been recomputed according to the 
slope: we find B = 0.074 and B — 0.022 for 8 = 0.6 and 8 — 0.9, respectively. The results show that the impact of 
the changes in the assumed slope affects the values of A u at 10 arcsec by less than la. 

It is important now to compare the clustering properties of HDF-South to the corresponding results for HDF- 
North that we obtained in our previous analysis (A99). The comparison is presented in Figure ^. The left panel shows 
the behaviour of A& computed at 10 arcsec (and multiplied by the bin size Az, for consistency with A99). In spite of 
the smallness of the regions, the amplitudes of the correlation function measured in the two Hubble deep fields are 
in good agreement, showing a small field-to-field variation. The results confirm the behaviour of the clustering with 
the redshift we found in A99. Namely, the clustering amplitude declines from z = to z ~ 1 and increases at higher 
redshifts to become, at z > 2, comparable to or higher than that observed at z ~ 0.25. At z ~ 4 the clustering signal 
measured in HDF-South is very noisy and we cannot confirm the high value of A w found in the northern field. An 
alternative measure of the correlation strength is the comoving correlation length ro- Its redshift evolution, computed, 
as in Magliocchetti & Maddox (1999), assuming a flat universe with present matter density parameter Qom = 0.3, is 
shown in the right panel of Figure |fj. Again, we find a slightly declining or almost constant behaviour up to z ~ 1 
and an increasing trend from z ~ 2 to z ~ 3. 

Since the clustering amplitude of the dark matter decreases continuously with redshift (the actual behaviour 
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Figure 5. The angular correlation functions li)(9) for galaxies with Iab < 27.5 measured for different redshift intervals (as 
specified in each panel). The uncertainties are nearly Poisson errors. The results and the power-law best-fit obtained using 
the classical ACF estimator are shown by filled circles and solid lines, while open circles and dashed lines refer to the results 
obtained with the weighted ACF estimator. 



depending on the cosmological scenario), the observed increase of the galaxy clustering at high redshift implies that 
galaxies at z ~ 3 — 4 are biased tracers of the underlying dark matter. This effect is illustrated in Figure (?], where 
we show the bias parameter b as a function of redshift both for the HDF-South and HDF-North. The values of b 
are computed by dividing the rms galaxy density fluctuation inside a sphere of 8h~ 1 Mpc at a given z (cr| al ) by the 
rms mass density fluctuation (<t™) predicted by linear theory. We consider two cosmological models with a cold dark 
matter (CDM) power spectrum normalised to reproduce the local cluster abundance (Eke, Cole & Frenk 1996): 
an Einstein-de Sitter SCDM model (&s(z = 0) = 0.52 and T = 0.45; left panel) and a flat ACDM model with 
flom = 0.3 and Qoa = 0.7 (cr™(z = 0) = 0.93 and V = 0.21; right panel). The observed bias parameters for the HDF- 
South are in good agreement with our previous results for the northern field. In particular we observe some anti-bias 
(b(z < 1) ~ 0.5) at low redshift, while we confirm that the high-redshift galaxies are strongly biased with respect to the 
dark matter: b(z ~ 3) ~ 3, 2 for the SCDM and ACDM models, respectively. This supports a model of biased galaxy 
formation where b is evolving with redshift. For comparison, in the same plot we also show the theoretical expectations 
for the effective bias (see Matarrese et al. 1997 and Moscardini et al. 1998 for a definition) computed for the same 
cosmological models using different minimum mass for the dark matter haloes (M min = 10 10 , 10 11 , lO 12 h -1 M ). For 
the Einstein-de Sitter model, we can reproduce the observations with a minimum mass M mi „ ~ 10 10 /2T 1 Mq at 2 < 1 
and M m i n ^ 1O 11 /i" 1 M between 1 < z < 3. For the ACDM model M min < 10 10 h~ 1 M Q is required at z < 1 , 
10 10 < Mmin < lO u ft _1 M for 1 < z < 2 and M min > lO 11 /!" 1 ^^ at < z >= 3 are required. 

At redshift z ~ 3, alternative estimates of the galaxy clustering come from the analysis of the Lyman Break 
Galaxy (LBG) samples. We find that the bias measured for the HDF-population is smaller than the one observed for 
the bright LBGs (Steidel et al. 1996). Assuming for example an Einstein-de Sitter model, Adelberger et al. (1998) 
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Table 2. The amplitude of ui(6) at 10 arcsec (A^) for different rcdshift bins. Column 1: rcdshift interval. Column 2: number of 
galaxies with Iab < 27.5 and best photometric rcdshift belonging to the redshift bin. Columns 3 and 5: amplitude computed 
assuming a slope <5 = 0.8 for the classical and weighted ACF estimator, respectively. Column 4: amplitude A^ computed using 
the classical ACF but assuming a different slope (S = 0.6 and <5 = 0.9). 



Classical ACF Weighted ACF 



z range 


Number 


A^(lOarcsec) 


A^lOarcscc) 






Iab < 27.5 


<5 = 0.8 


<5 = 0.6,0.9 


<5 = 0.8 


0.0 - 


0.5 


135 


0.07±0.05 


0.07,0.07 


0.13±0.08 


0.5 - 


1.0 


281 


0.06±0.02 


0.07,0.06 


0.05±0.04 


1.0 - 


1.5 


109 


0.10±0.06 


0.12,0.09 


0.12±0.09 


1.5 - 


2.5 


163 


0.08±0.04 


0.09,0.08 


0.06±0.05 


2.5 - 


3.5 


113 


0.16±0.05 


0.19,0.15 


0.21±0.08 


3.5 - 


4.5 


43 


0.01±0.15 


0.03,0.00 


0.09±0.23 




z z 

Figure 6. Comparison of the clustering properties of galaxies in HDF-North and South. Left panel: the redshift evolution of 
the ACF amplitude A w at 10 arcsec (multiplied by the bin size Az). Open triangles refer to the values of the HDF-North 
obtained by A99, while filled and open circles refer to the results obtained in this work adopting the classical and the weighted 
estimators, respectively. The different measurements have been shifted by z = (z) + / — 0.05 for clarity. Right panel: the redshift 
evolution of the comoving correlation length ro(z) (in h~ 1 Mpc) as computed by assuming a flat universe with Qom = 0.3. The 
meaning of different symbols is the same as in the left panel. 

found for the spectroscopic sample of LBGs a bias parameter b(z = 3) ~ 6 while Giavalisco et al. (1998) found 
b(z = 3) « 4.5 from the photometric sample (see also Giavalisco & Dickinson 2001). Averaging our results for HDF- 
South and HDF-North, we find b(z = 3) » 2.8. These differences can be explained by the different surface galaxy 
densities (larger in the HDF fields, which have approximately 30 objects per square arcmin). In fact in the hierarchical 
galaxy formation scenario, more massive and rare objects form in rarer and higher peaks of the underlying matter 
density field; as a consequence they are expected to have a higher value of the bias parameter. 



4 THE ERROR BUDGET 

Up to now, we have assumed in our measurements that the errors are nearly Poissonian. We have neglected other 
possible contributions to the errors, due to the finite area of the survey or to the clustered nature of the galaxy 
distribution. In this section we deal with these effects basing our analysis on analytical expressions of the cosmic 
errors calculated by Bernstein (1994). We estimate the relative magnitude of various contributions to the errors and 
show that in fact our nearly Poissonian errorbars are consistent. We also examine the possible improvements brought 
by a survey made with the Advanced Camera on HST. 
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Figure 7. The measured bias b as a function of redshift for an Einstein-de Sitter SCDM model and ACDM model (left and 
right panels). The open triangles refer to the values obtained for the HDF-North in A99; filled and open circles refer to the 
values obtained in this work for the HDF-South using the classical ACF and the weighted ACF estimators, respectively. The 
different lines represent the theoretical effective bias computed for the same cosmological models assuming different values of 



minimum mass M u 
ft- 1 M Q . 



We show results for M„ 



10 (solid lines), 10 (short-dashed lines) and 10 (long-dashed lines) 



4.1 Analytic expression for cosmic uncertainties 

Originally, LS93 have derived the variance of their estimator by assuming the weak correlation limit but neglecting 
the contribution of the higher-order correlation functions. The computation has been generalized by Bernstein (1994, 
hereafter B94; see also Hamilton 1993; Szapudi 2000) for any clustering regime taking into account higher-order 
correlation functions but neglecting edge effects. The Bernstein's equation is obtained in the case of a (degenerate) 
hierarchical model and has been rewritten as follows: 



4(1- 2q z + qt 



N 



uj(6) 2 
uj r (6){l + 2q 3 uj{&)) 



M6») -Ue max ) 2 + 



+ 93-1 



u>(ey 



[2(1-2 ?3 M0) - 2q 3 io r (e)+oJe Ta ^(3q3'l)-l] 



(r for 1 n l±i^g) _ i _ 1 - , 



- 1 - LUe n 



(£3) 



In this equation, valid in the regime 6/6 mBX , i^9 max , l/N g <C 1, N g is the number of galaxies in our sample. The 
function u r (6) is the average of the two-point correlation over a shell corresponding to angles in the bin [6, 6 + 56]. 
In a first approximation, ui r (8) ~ lj(6) (see B94). The term G p (6) is the probability of finding two randomly placed 
galaxies with separation in the range [6,6 + 86]: 

G P (6)=<RR(6)> /lN g (N g -l)/2]. (9) 

The parameters qs, Q4 are related to the hierarchical amplitudes of the cumulants of the dark matter distribution 
S3, S4 (Sn = > /M^ -1 , where lon are the iV-point angular correlation functions, and ujn corresponds to their 
integral average over a disk of radius 6) by 53 ~ £3/ 3 and q& ~ S4/I6. 

Equation (IS) is composed of three terms, which we call Ei, E2 and E3. 
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The first contribution to the errors, E\, hereafter referred as the finite volume erroiji] (e.g. Szapudi & Colombi 
1996), does not depend on the number of galaxies in the catalogue. It comes from the finiteness of the area covered 
by the survey. In a first approximation, this is proportional to the average of the two-point correlation function over 
the survey area, ZJ9 max [see equation (Q)]. 

The second (E2) and third (E3) terms reflect the discrete nature of the catalogue. They account for random 
fluctuations of the galaxy distribution as a local Poisson realization of a continuous underlying field (e.g. Szapudi & 
Colombi 1996). The term E2, proportional to 1/N g , appears only in correlated sets of points (see B94): it cancels in 
the Poisson limit, u) — > 0. The pure Poisson error is in fact contained in the next order term, E3, proportional to 
1/Ng. Hereafter, E2 and E3 will be referred to as the discreteness errors. Note that discreteness and finite volume 
effects can be disentangled only approximately: there are terms proportional to Z<Je max in E2 and E3. They correspond 
to hybrid, "finite-discreteness" effects. However these latter give only very little contribution to E2 and E3 and can 
in fact be neglected in most realistic situations. Finally, the error estimate of B94 neglects edge effects which become 
significant at the largest angular scales. The advantage of the LS estimator is to reduce these latter as much as 
possible, and therefore equation (^) is expected to give a good estimate of the cosmic errors even in this regime, 
although it might slightly underestimate them. 



4.2 Assumptions used to compute the cosmic errors 

From equation (0), one can see that the calculation of the cosmic error for ui(8) requires prior knowledge of statistics 
up to order four, in particular lo{6) itself, Z^e max , 93 and 54. To estimate them, we proceed as follows. 

• The value of lu(9) taken in equation (^) is estimated from the best fits, Wfu, obtained in Figure |B|; 



• The calculation of wj maJ is done as explained at the end of § 3.2. Note that computing the integral constraint 
in such a way, by assuming a power-law behavior for the two-point correlation function in all the regimes, might in 
turn lead to overestimating ZZ>9 max . Indeed lo(6) is expected to present a cut-off at large scales, at least if low-z results 
(such as measurements of uj(0) in the APM; e.g. Maddox et al. 1990) can be extrapolated to higher redshifts. 

• The choice of 53 and 54 is more delicate: these parameters cannot be inferred from self-consistent measurements 
in the catalogues analysed in this paper and in A99. Indeed, higher-order statistics are more sensitive to cosmic errors 
than the two-point correlation function, with an error which increases with the order considered. Therefore it would 
be impossible to extract reliable values of 93 and 54 from these catalogues mainly contaminated by shot-noise, even 
with strong prior assumptions such as assuming a power-law behaviour for higher-order correlation functions similarly 
as we did for u>(8). Instead, we use measurements of S3 = 3q3 and S4 = 16 54 obtained in the local universe (2 = 0) 
by Gaztanaga (1994) with the APM catalogue (Maddox et al. 1990) at 6 ~ 0.1°: S 3 (z = 0) ~ 4 and S 4 (z = 0) ~ 50. 
At the level of approximation used in this paper, we can neglect a possible dependence of 5*3 and 5*4 on the angular 
scale. However, evolution with redshift of these quantities might be important, particularly if the bias between the 
galaxy and the dark matter distributions increases significantly with redshift, as suggested by the measurements in 
this paper. Both theoretical calculations based on perturbation theory (e.g. Juszkiewicz, Bouchet & Colombi 1993; 
Bernardeau 1994) and measurements in N-body simulations (e.g. Colombi, Bouchet & Hernquist 1996; Szapudi et 
al. 1999) show that the parameters S3 and S4 measured in the dark matter distribution do not evolve significantly 
with time, at least at the level of approximation of this paper. However, the bias can strongly affect higher-order 
statistics: in general, increasing the bias factor b reduces the values of S3 and S4 compared to what is obtained in 
the dark matter distribution. Here, following Colombi et al. (2000), we adopt two simple, extreme models. The first 
one consists in assuming that the effect of biasing is negligible: Sn{z) = Sn(z = 0), that we refer to as the no bias 
model. The second one is motivated by observational results (Szapudi et al. 2001) and to some extent by theoretical 
calculations (Bernardeau & Schaeffer 1992; 1999): S N {z) = S N {z = 0)/[b(z)] 2<N ~ 2) . For the values of b(z), we take 
the ACF measurements in the HDF-South obtained with the classical approach (unless otherwise specified) as shown 
for each cosmology in Figure [?] (filled circles) - i.e. we assume that the APM galaxies are unbiased with respect to 
the dark matter distribution and that the HDF galaxies are biased with respect to the APM ones with bias equal to 
b(z). These models are referred to as the SCDM and AC DM bias models. 



4.3 The cosmic errors in the HDF fields 



In Figure p, we compare the magnitudes of the finite volume error, E x , the discreteness errors, E 2 and E 3 [e.g. 
equation (fc )], and the total error Suj/ujat = E 1 ^ 2 — (E\ + E2 + E3) 1 / 2 , at different angular separations with the errors 
used for the classical ACF measurement derived from equation (J3f) - The different panels show the relative errors for 



t also often called 



cosmic variance 
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the different redshift ranges as in Figure |B|. We show here the errors obtained from the analytical expressions using 
the SCDM bias model {e.g. b(z) obtained from the left panel of Figure (jj). 

As expected, the estimates of the errors used for the classical ACF [equation (^)] match quite well with the E3 
term of equation ^ (long-dashed lines). Because the sample is quite sparse, we have E3 ^ i?2, but E2 (short-dashed 
lines) is not negligible, except at the largest angular separation. The finite volume error (E-l term, dotted lines) plays 
an important role as well, especially at low z, where the effective size of the survey is small, and at large angular scales. 
Note that the results obtained at the largest scales have to be interpreted with caution since equation (Q, which 
assumes 6 small compared to the survey size, might be slightly outside its domain of validity. The total theoretical 
cosmic error (solid line) depends weakly on the scale and assumes its largest values at low and high redshifts: in the 
first case because of the finite volume effects, in the second case because of the Poisson noise. 

Figure ^| is similar to Figure but shows the dependence on redshift of the errors at a fixed angular scale, 
6 = 10" (this choice being arbitrary). Here, we consider various bias models: no bias (left panel), the SCDM bias 
model (middle panel) and the AC DM bias model (right panel) . Since the theoretical expression (equation |^) is now 
compared to the equation |§] for both analyses of the HDF-South (filled circles) and the HDF-North (open squares, 
from A99), for any survey-dependent quantity in equation (^) (namely N g , cv^t(9) or function b(z)), we take the result 
obtained from the average between the two fields. 

Again, the overall agreement between the E3 term and the errors estimated from equation ^| is pretty good, as 
expected. In most cases, the term E3 dominates the total error at this angular scale, except at low z for the bias 
models, where the finite volume error dominates. Indeed, as a result of our rather extreme modeling of the effect of 
the bias on higher-order statistics, S3 oc b~ 2 and S4 cx 6~ 4 (§ 4.2), the effects of changing b(z) can be important 
on the finite volume errors, especially if b(z) < 1. This is the case at low redshifts in the HDF population both for 
SCDM, where b(z = 0) ~ 0.7, and ACDM, where b(z = 0) ~ 0.4 (e.g. Figure |). 

The results presented in Figures ^ and ^ show that the total theoretical cosmic error given by equation ^ can be 
significantly larger than the estimate given by equation (^). This might sound surprising, because the measured values 
of u) are rather small, ui(8) 0.4 (e.g Figure |E]): thus one might argue that the weak clustering regime approximation 
(^) should be valid to estimate the errors. In practice, we see that this assumption is incorrect, particularly at low z, 
at least in the examples examined here. However, the amplitude of E^^iff) is at most ~ twice larger than the error 
given by equation (^) and shows the same global shape. Furthermore, as mentioned in § 4.2, we max is likely to be 



overestimated with the method we use, which might artificially increase the observed difference between equation (|3 
and equation (^). Finally, one has to be aware of the fact there is a subtle difference between the calculations of 
LS93, which lead to equation ^ and those of Bernstein, which lead to equation (^). In the first case, the authors 
considered a conditional statistical average, using the supplementary information that the number of objects in the 
catalogue N g is known. In the second case, the author did not use such information, which naturally leads to slightly 
larger errors, since N g is not conditionally fixed and can fluctuate. 

Given the level of approximation used in this paper, it is thus fair to conclude that the weak clustering approx- 
imation is good enough, which confirms a posteriori the validity of the approach used in A99 and up to § ^| in this 
work, to compute errors. 

In order to quantify how the Advanced Camera on the HST (Pirzkal et al. 2001) can improve the clustering 
measurements of HDF-like populations, we have estimated the analytical behaviour of the cosmic errors with redshift. 
The results are shown for 9 = 10" in Figure |l(j, which can be directly compared with Figure ^. To estimate the 
cosmic errors in Figure [l^, we have rescaled the mean observed number of galaxies in the HDF-South and North to 
the respective area of the Advanced Camera (i.e. by a factor 5.33) and recomputed the term we max according to the 
new area, assuming a square geometry. Other quantities, in particular b(z) and the amplitude o;(10") are the same as 
in Figure ^. Note again that the method we use to calculate wg max is likely to overestimate its real value and therefore 
finite volume effects. 

Finite volume errors become smaller due to the larger area covered, while discreteness effects are reduced due 
to the larger number of objects in the survey. Except at small z, where the effect of the bias can make E\ dominant 
again, the sizes of E\ and E2 are of same order, and E3, which contains the pure Poisson noise, is now negligible. 
Thus, a survey made with the Advanced Camera will no longer be dominated by shot-noise. In terms of sampling 
strategy, we find that this kind of survey will be a good compromise between finite volume effects and discreteness 
effects (e.g. Colombi, Szapudi & Szalay 1998), with a gain of more than a factor two for the total cosmic errors 
compared to the present data, at least at the scale considered here. 



14 Arnouts et al. 



Analytical Errors with SCDM bias model 
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Figure 8. Comparison between the nearly Poissonian errorbars [equation (|3|), filled circles] used in the computation of u>{9) 
(Classical ACF) at different angular separations with the analytical errors of equation finite volume error E^ 2 (dotted line); 
discreteness errors E^ 2 (short-dashed line), E^ 2 (long-dashed line) and tota l cq smic error E 1 / 2 = (Ex +E 2 + E 3 ) 1 / 2 (solid 



line). The analytical errors are computed using the SCDM bias model (see § | 
to the different redshift ranges as in Figure ^| 
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Figure 9. Comparison of the nearly Poissonian errorbars (equation (J3|) at 10 arcsec, estimated for the HDF-South (filled 
circles) and the HDF-North (open squares) with the cosmic errors from equation (^) (total: solid lines; Ey. dotted lines; E2: 
short-dashed lines; E3: long-dashed lines). The left, middle and right panels refer to the no bias, the SCDM bias and the 
ACDM bias models, respectively (see text). There is one filled circle missing on each panel at z = 4, which corresponds to a 
very large Poisson errorbar Slj /cjg t ~ 23 (see the right bottom panel of Figure H) . 
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Figure 10. As Figure g, but in the case of a survey with the Advanced Camera (see text). Only the theoretical cosmic errors 
from equation (ph are displayed. 



5 CONCLUSIONS 

In this paper, we have described the measurement of the galaxy clustering as a function of z in the HDF-South 
based on a combination of optical HST data and VLT/ISAAC infrared data. The main results can be summarized 
as follows: 

• The redshift distribution obtained for the HDF-South up to Iab ~ 27.5 is consistent with that observed for the 
HDF-North. The peak of N(z) is close to z ~ 0.8 with a decrease at z ~ 1, a plateau from 1 < z < 3, followed by a 
decline of the number of objects up to z ~ 4.5. 

• We have described an alternative approach to include photometric redshift uncertainties in the ACF measure- 
ments. The method is based on a weighted measurement of the ACF taking into account the redshift probability 
distribution of each object. This method makes it possible to extract the clustering signal with higher significance. 
It will be interesting to implement and to test the method also for ground-based data which typically have larger 
photometric errors than HST data, and for less reliable photometric redshift, i.e. with a larger spread of the redshift 
probability distribution. This approach can be extended to any kind of evolutionary studies based on photometric 
redshifts, like, for example, the luminosity function. 

• We have compared the results of the clustering evolution obtained in the HDF-North and HDF-South. Both 
are fully consistent within the Poissonian uncertainties. The new observations confirm our previous findings for the 
HDF-North (A99). The clustering amplitude shows a decrease between < z < 1 and an increase at z > 1.5. 
The redshift range 1 < z < 2 seems to be a critical epoch where the HDF-galaxy clustering reaches a constant 
regime still difficult to characterize, due to the smallness of the present sample and to the critical redshift range 
for photometric redshift determination. Larger samples with the HST Advanced Camera will improve significantly 
the present picture. The comparison with the behaviour of the underlying dark matter shows that the HDF-galaxy 
population is a nearly unbiased or anti-biased tracer of the dark matter distribution at z < 1 and z < 1.5 in SCDM 
and ACDM models, respectively. At higher redshift the clustering amplitude increases and the bias of this population 
too. At (z) ~ 3, the bias is b ~ 3 and b ~ 2 for SCDM and ACDM models, respectively. This is in good agreement 
with the results we obtained for HDF-North (see also Magliocchetti & Maddox 1999). The typical minimum masses 
of the hosting dark matter haloes required to reproduce the observations in SCDM model are Af m i n = 10 10 Mq 
atz< 1.5 and M min ~ 1O 11 /i _1 M for 1.5 < z < 3.5 (M min < 10 10 /i _:L M Q at z < 1.5 and M min ~ W 11 ' 11 - 5 h' 1 M Q 
for 1.5 < 2 < 3.5 in ACDM model). At (z) ~ 4, the clustering signal detected in the HDF-South is considerably 
smaller than the corresponding amplitude observed in the northern field, but, due to the very small sample (and, as a 
consequence, a large Poisson noise), the two results are still consistent within la. Again, larger samples are required 
at such a redshift. 

• In all our analysis we used errorbars assuming nearly Poisson statistics (w(6) <C 1), as given by Landy & Szalay 
(1993). To check a posteriori that such a procedure is valid, we used the analytical approach of Bernstein (1994), 
which fully describes the global budget of cosmic errors. In particular, the formulae obtained by Bernstein (1994) 
do not assume uj(0) <C 1 and take into account effects of higher-order statistics. We checked that we recover the 
nearly-Poissonian contribution in Bernstein's calculations, and we found that it is indeed dominant in most regimes, 
except at small redshifts and at large angular scales, where the finite volume error (often called cosmic variance) can 
become significant. Note that Bernstein's calculations neglect the edge effects, which can contribute to the errors (e.g. 
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Szapudi & Colombi 1996). However, by construction, the Landy & Szalay estimator, that we used in our analysis, 
should minimize them to a large extent. 

As a general conclusion of this paper, the HDF samples allowed us to obtain a global picture of the redshift 
evolution of the galaxy clustering, but with errorbars dominated by Poisson noise. Future instruments, like the 
Advanced Camera, will improve the accuracy of the measurement of uj(8) by at least a factor two, mainly by reducing 
discreteness errors. In particular, pure Poisson noise will become subdominant and it will no longer be possible to 
neglect finite volume effects in the analyses. 
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